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ABSTRACT 

Using deep 100 and 160 fim observations in GOODS-South from GOODS-ifer.se/ie/, combined with 
high resolution HST/ WFC3 near-infrared imaging from CANDELS, we present the first detailed mor- 
phological analysis of a complete, far-infrared (FIR) selected sample of 52 Ultraluminous Infrared 
Galaxies (ULIRGs: Lir > 1O 12 L ) at z ~ 2. We also make use of a comparison sample of galaxies 
with lower IR luminosities but with the same redshift and H-band magnitude distribution. Our vi- 
sual classifications of these two samples indicate that the fractions of objects with disk and spheroid 
morphologies are roughly the same but that there are significantly more mergers, interactions, and 
irregular galaxies among the ULIRGs (72+7% versus 32 ± 3%). The combination of disk and irregu- 
lar/interacting morphologies suggests that early stage interactions, minor mergers, and disk instabil- 
ities could play an important role in ULIRGs at z ~ 2. We compare these fractions with those of a 
z ~ 1 sample selected from GOODS-H and COSMOS across a wide luminosity range and find that 
the fraction of disks decreases systematically with Lir while the fraction of mergers and interactions 
increases, as has been observed locally. At comparable luminosities, the fraction of ULIRGs with 
various morphological classifications is similar at z ~ 2 and z ~ 1, though there are slightly fewer 
mergers and slightly more disks at higher redshift. We investigate the position of the z ~ 2 ULIRGs, 
along with 70 z ~ 2 LIRGs, on the specific star formation rate versus redshift plane, and find 52 
systems to be starbursts (i.e., they lie more than a factor of three above the main sequence relation). 
We find that many of these systems are clear interactions and mergers (~ 50%) compared to only 24% 
of systems on the main sequence relation. If irregular disks are included as potential minor mergers, 
then we find that up to ~ 73% of starbursts are involved in a merger or interaction at some level. 
Although the final coalescence of a major merger may not be required for the high luminosities of 
ULIRGs at z ~ 2 as is the case locally, the large fraction (50 — 73%) of interactions at all stages 
and potential minor mergers suggests that these processes contribute significantly to the high star 
formation rates of ULIRGs at z ~ 2. 

Subject headings: Galaxies: evolution - Galaxies: active - Galaxies: starburst - Infrared: galaxies 
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1. INTRODUCTION 

Since their initial discovery by IRAS, luminous and 
ultraluminous infrared galax ies ((U)LIRGs: Ltr > 
10 11 , 10 12 L Q - see review bv lSanders fc Mirabell (|1996f )) 
have been considered an important transition stage 
between gas-rich spir al galaxies and mas sive elliptical 
galaxies and quasars (San ders et all ir988). Studies of 
their morphologies showed that the infrared luminosity 
of such systems is correlated with their merger stage, 
such that lower luminosity LIRGs (Lir < 10 11,5 L@) are 
ordinary disks while higher luminosity LIRGs (Lir > 
10 115 Lq) are interactions and the highest luminosity 
systems (ULIRGs) are at an adv anced merging stage 
(jVeilleux et all l200l llshidal [2004). Since then, inves- 
tigations at higher redshift with Spitzer and now Her- 
schel have shown that while rare locally, (U) LIRGs were 
once much more common and even dominated the cos- 
mic star formation rate at z > 1 (e . g.JFloc'h et al.|[ 2005; 
ICaputi et all 120071: iMagnelli e t al. 2009; Mag nelli et all 
120111 : iBethermin et al-ll2011b iMurphv et al.ll2011f ). 

How do these high redshift (U) LIRGs compare to 
their local counterparts? Investigat i ons at z ~ 1 (e.g., 
jZheng et all l200i [BeTeT"aII [20051: feridge et al.l [20071 : 
IShi et al. 2006, 2009; Kartalte peet al.ll2010bD have found 
a similar trend between galaxy morphology and total 
infrared luminosity, with sources at the highest lumi- 
nosities dominated by major mergers. Initial attempts 
at even higher redshift (z ~ 2), at the peak of galaxy 
assembly, have so far proven difficult and such studies 
have been affected by small number statistics, morpho- 
logical /c-corrections, different methods for identifying 
galaxy mergers, and various selection effects. Morpho- 
logical analyses at z > 1 are difficult for several rea- 
sons. The effects of surface brightness dimming make 
objects very faint and identifying low surface brightness 
features ( such as tidal tails and de bris) becomes problem- 
atic (e.g.. iHibbard fc Vaccal [19971 ). In addition, at these 
redshifts optical images probe the rest-frame ultraviolet 
(UV) light in galaxies, which traces regions of active star 
formation but not the older stellar populations needed 
to discern the true structure of these galaxies and is sub- 
ject to heavy dust obscuration. Indeed, studies of local 
(U) LIRGs from the GOALS survey have found that ob- 
scuration is more pronounced in ULIRGs and that many 
double nuclei are hidden even in the rest-frame optical 
(though they appear in the NIR - Haan et a. 2011). 
Ideally, near infrared imaging is needed for a direct com- 
parison of z ~ 2 rest-frame optical morphologies with 
galaxies at lower redshift. However, prior to WFC3 on 
HST, this has only been possible for small numbers of ob- 
jects using adaptive optics from the ground or NICMOS 
on HST. 
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Such studies have found a wide range of results, from 
merg e rs being a dominant process (e.g., Dasvra et al.l 
12008b iBussmann et al.l 1201 it iZamoiski et al.l 120111) to 
mergers playing a fairly m inor role (e.g.. lMelbourne et al.l 
2009; iTargett et all 1201 ll ). The latter results have been 
intriguing because they suggest that there has been a 
shift in the driver of star formation in ULIRGs with red- 
shift. Such an idea has been supported by some nu- 
merical simulations that suggest that at high redshift, 
mergers are not necessary for the extreme luminosities 
of these systems but that instead such high star forma- 
tion rates can be maintained by the steady -state accre- 
tion of c old gas onto star f orming disks (e.g. JDekel et al.l 
2009aH; E ave et al.ll2010r >. In such a scenario, the mor- 
phologies of high redshift ULIRGs would appear as or- 
dinary disk-dominated systems or possibly as irregular 
"clumpy" disks. However, other simulations do lend 
support to the idea that mergers play a crucial role in 
high-redshift ULIRGs (e.g., Chakrabarti et al. 2008; 
Narayana n et al. 2009, 2010; Hop kins et al. 2010). A 
study by iRujopakarn et all (|2011h found that the spa- 
tial extent of star formation in high redshift ULIRGs 
more closely matches that of local LIRGs rather than 
the high concentrations observed in local ULIRGs, sup- 
porting the idea that the driver of these high star forma- 
tion rates evolves with redshift. To further complicate 
this picture, kinematic studies of BzK selected galaxies 
(that span a wide range of Ltr) have foun d evidence for 
disk- l ike rotation and c lumpy disks (e.g., iGenzel et al.l 
2008; iDaddi et al.ll2010l ) while similar studies of submil- 
limeter galaxies (typically more luminous than BzK se- 
lected galaxies) have found evidence for mergers (e.g., 
iTacconi et all 120081 : lEngel et al.ll2010f ). 

While these early results have been intriguing, discrep- 
ancies among the different studies have raised many ques- 
tions and a clear picture has not yet emerged. One rea- 
son for the differing results has been that each study 
has adopted a different definition and way of identifying 
galaxy mergers, including identifying separated pairs of 
galaxies based on redshift information (e.g., Bridge et al. 
2007), searching for merger signatures via visual classi- 
fication (e.g., Dasyra et al. 2008, Zamojski et al. 2011, 
Kartaltepe et al. 2010b), using various automated clas- 
sification methods (such as asymmetry [e.g., Conselice et 
al. 2003] and location on the Gini-M2o plane [e.g., Lotz 
et al. 2008]), and measures of galaxy kinematics (e.g., 
Genzel et al. 2008, Melbourne et al. 2009). Other stud- 
ies use automated measures, such as a galaxy's Sersic 
index, to infer the lack of galaxy mergers (e.g., Targett 
et al. 2011). All of these different methods for identifying 
mergers make the results from each of the various stud- 
ies difficult to directly compare to one another, at least 
in part because each one is sensitive to different merger 
stages and each has different levels of detectability as a 
function of redshift. In order to put together a clear pic- 
ture, the same criteria for identifying galaxy mergers is 
needed across a wide luminosity range for a large sample 
of objects. 

Another difficulty has been the selection effects of the 
various studies. In order to construct an unbiased sam- 
ple of ULIRGs at high redshift and therefore obtain a 
complete picture of their properties, one would ideally 
select objects based on data in the far-infrared (FIR) 
near the peak of emission. This has been possible for 
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small samples using MIPS 70 um imaging with Spitze r 
(e.g., iSvmeonidis et al.l 12008b iKartaltepe et al.l l2010af ). 
however, MIPS 70 /im lacked the sensitivity and reso- 
lution to detect anything but the most extreme sys- 
tems at z ~ 2, and by z ~ 2, no longer probes the 
peak of emis sion. Submillimeter s u rveys have als o been 
used (e.g., [C onseliceZeTaD 120031 : iTacconi et all 120081 : 
iSwinbank et al. 2010), but tend to pick out extreme sys- 
tems at high redshift with a bias toward cold dust emis- 
sion (jPope et all 12006( 1 . With a lack of deep FIR data, 
most of these previo us studies have had to rely on either 
24 /im selection (e.g.. iDasyra et all 12 0081 : [Zamoj ski et al.l 
120111 ). known to be biased toward obscured AGN (e.g., 
Sajina et al. 200 7) or various color selection techniques 
(BzK selection: llDaddi et al.l 12001 DOGs: iDev et al.l 
2008, etc.), which identify objects over a wide range of 
infrared luminosities, resulting in a muddled picture of 
the nature of high redshift ULIRGs. 

Without the addition of far-infrared data to the SED, 
it is difficult to determine an accurate total infrared lu- 
minosity. Several studies have shown that using 24 /im 
data alone and determining a total infrared luminosity 
based on locally-cal ibrated SED templates results in an 
over estimate (e.g., | Papovich et al.l 120071: iNordon et all 
120101 : lElbaz et alJl2010U2011l : IKartaltepe et al.ll2010al) at 
z ~ 2. Obtaining FIR-submillimeter data points for large 
samples of objects has previously been difficult and time 
consuming or has required the use of stacking. 

New surveys have completely changed this situa- 
tion. Deep far-infrared imaging is now available 
at 70-500 /im from the Herschel Space Observatory 
(jPilbratt et al.l I2010D. In particu lar, the GOODS- 
Herschel survey (|Elbaz et al.l 120111 ) has obtained the 
deepes t 100 and 160 /im ima ging with the PACS instru- 
ment (PoglitscheFal] 12010( 1 over the central region of 
the GOODS-S field. These data have allowed us to con- 
struct a complete, flux-limited sample of ULIRGs out 
to z ~ 3 with accurate total infrared luminosities for 
the first time. In addition, many LIRGs, though incom- 
plete, are detected over this redshift range. This field has 
been imaged in its entirety (along with several other deep 
fields) in the near- i nfrared using WFC 3 on HST by the 
CAN DELS survey (jGrogin et a,lJl2011b iKoekemoer et al.l 
120111 ), providing deep, high resolution, rest-frame optical 
imaging for our entire sample of high-redshift ULIRGs. 
Here, we analyze the morphologies of this complete sam- 
ple and discuss how the role of galaxy mergers in the for- 
mation of massive galaxies and their stars has changed 
over cosmic time. 

This paper is organized as follows: Section 2 introduces 
the two surveys used here and describes our sample se- 
lection. In Section 3 we present our comparison samples 
and Section 4 presents our visual classification scheme. 
We present our results in Section 5 and discuss their im- 
plications in Section 6. We summarize our findings in 
Section 7. Throughout this paper we assume a ACDM 
cosmology with Ho = 70kms _1 Mpc -1 , = 0.7, and 
Q m = 0.3. All magnitudes are in the AB system unless 
otherwise stated. 



2. OBSERVATIONS AND SAMPLE SELECTION 



The sample of high-redshift ULIRGs analyzed in this 
paper comes from observations of the Great Observato- 
ries Origins Deep Survey (GOODS) Southern field taken 
with the Herschel Space Observatory as a part of the 
GOODS- Herschel open time key program (GOODS-H: 
PI Elbaz). These observations cover a total area of 
10' x 10' with the deepest region (down to depths of 0.8 
and 2.4 mJy (3 a) at 100 and 160 /im, respectively) cover- 
ing 64 /2 . These observations are the deepest undertaken 
by Herschel to date. In addition, we use the GOODS-H 
observations of the GOODS-N field for the z ~ 1 compar- 
ison sample (see §3). These observations cover an area 
of 10' x 16' down to depths of 1.1 and 2.7 mJy (3a). A 
catalog of the flux densities and their uncertainties were 
obtained from point source fitting based on the known 
prior positions of MIPS 24 /im sources in the field, which 
were in turn extracted based on IRAC prior positions. 
For a detailed description of GOODS-H, t he sou rce ex- 
traction, and photometry, see Elb az et al.l (|2011f ). 

In addition to the GOODS-iJer.se/ie/ data, we make 
use of the full multiwavelength data set available for 
GOODS-S, i ncluding optical HS T- ACS images and 
photometry (Giavalisco et al. 2004), ground-based NIR 
photometry (RetzlaffeFaD 12010( 1. and IRAC (3.6- 
8.0 /im) and MIPS (2 4-70 jam) photometry from Spitzer 
(MagnellieTaI]|I3l|). We use a catalog of z-band se- 
lected objects with photometr y in all of the optica l-NIR 
bands measured using TFIT (Laidle£iES|2007). For 
each object detected at either 100 or 160 /im, we matched 
the IRAC position from the prior-based catalog to an op- 
tical counterpart in the TFIT catalog. For each source, 
we used a spectroscopic redshift if available (for 73% 
of the sources) and a photometric redshift derived us- 
ing SED template fit s to the UV-IR TFIT photometry 
for the rest (cf. iDahlen et aDl2010l ). We derived stel- 
lar masses for each (U)L IRG in our sample by fitting 
iBruzual fc Charlotl J2003) models to the observed data, 
fixing the redshift to either the spectroscopic or photo- 
metric redshift. The SED models span a large parame- 
ter space in terms of age, extinction, star formation his- 
tory, and metallicity. The masses were derived using a 
iChab rier (2003) initial mass function. 

We determined the total infrared luminosity, Lir 
(rest-frame 8-1000 /im), for each source by fitting 
the MIR-FIR SED (using the MIPS 24 /im, 70 /im 
[where available], PACS 100 /im and 160 /im pho- 
tometry) with several different template libraries 
(ICharv fc ElbaJ200lHDale fc Helo j 200a lLag ache et al.l 
120031 : ISiebenm orgen fc Kriigell \200% using the SED fit- 
ting code Le Phar^\ written by S. Arnouts and O. II- 
bert. For each template library, the best fit model was 
chosen by finding the one with the lowest x 2 value and 
allowing for rescaling of the templates. Each of the 
data points was weighted equally to avoid overweight- 
ing points with the smallest error bars (typically 24 /im). 
The total infrared luminosity was then calculated from 
the best fit template by integrating from 8-1000 /im. 
The vast majority of the sourc es were best fit using the 
Siebenmorgen & Kriigel (2007) library, which is based on 
radiative transfer models and spans a wide range of tem- 
plate shapes. We note that the value of Lir determined 
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Figure 1. Total infrared luminosity (Xir) as a function of redshift 
for all of the 100 and 160 jum detected galaxies in GOODS-S from 
the GOODS- Hers chel survey. The dividing luminosities for LIRGs 
(Lir > 10 11 L©) and ULIRGs (L m > 10 Y2 L©) are shown as the 
horizontal dashed lines and the dash-dot vertical lines highlight the 
redshift range of our sample (1.5 < z < 3.0). 

from the Sie benmorgen fc Kriigell (|2QQ7| ) library agrees 
well (within the associated error bars) with the value ob- 
tained using the other template libraries when allowing 
for rescaling (see discussion in Kartaltepe et al. 2010a). 

The final value of Lir is shown in Figure [I] as a func- 
tion of redshift with the LIRG and ULIRG luminosity 
limits highlighted. We selected an upper limit of z = 3 
since beyond this redshift, the ULIRG sample becomes 
incomplete and to ensure that the NIR imaging sam- 
ples the rest-frame optical light. The lower limit of 
z — 1.5 was chosen since there are very few ULIRGs 
at lower redshifts and to distinguish from our z ~ 1 
comparison sample. These limits select a complete sam- 
ple of 52 ULIRGs with (L m ) = 10 12 3 L© and (z) = 
2.2. Spectroscopic redshifts are avai lable for 45% of the 
sample in this red s hift range (from Szoko lvet al.l 1 2004: 
iKriek et all 120081; [Va nzella et al. 2008; Balestra et al. 
120101 : iSilverman etalll2010l : iFadda et al]l2QlQt Wuyts et 
al. 2009; Kurk et al., submitted). In addition to the 
ULIRG sample, there are 70 LIRGs over this same red- 
shift range with (L m ) = 1O 117 L and (z) = 1.8. The 
LIRG sample is incomplete and includes only the most 
luminous LIRGs at the high redshift end. 

2.2. CANDELS 

The Cosmic Assembly Near-Infrared Dark Energy 
Legacy Survey (CANDELS: Pis Faber & Ferguson; see 
Grogin et al. 2011 and Koekemoer et al. 2011) is an HST 
Mult i- Cycle Treasury Program to image portions of five 
different legacy fields (GOODS-N, GOODS-S, COSMOS, 
UDS, and EGS) with the Wide Field Camera 3 (WFC3) 
in the NIR. The survey is observing all five fields to 2- 
orbit depth in F125W (J-band, 2/3 orbit) and F160W 
(iLband, 4/3 orbits) and the central regions of GOODS- 
N and GOODS-S to 13 orbit depth in these bands as 
well as F105W ( F-band). For detail s on the full CAN- 
DELS survey, see lGrogin et al.l (j2011f ). In addition to the 
CANDELS observations, a portion of GOODS-S was also 



observed as a part of the Early Release Science (ERS) 
campaign in J and H. In this paper, we focus on the full 
(CANDELS+ERS) data set in GOODS-S. 

The CANDELS GOODS-S observations began in Oct. 
2010 and were completed in Feb. 2012. For this paper, 
we use the 2-orbit depth observations over the entire field 
for uniformity. The images were reduced and drizzled to 
a 0.06" pixel scale to create a full 2-orbit depth mosaic. 
T he details of the d a ta re duction pipeline are described 
in iKoekemoer et al.l (|2Qllh . The WFC3 photometry in 
both J and H bands were measured using SExtractor 
version 2.5.0 (jBertin fc Arnoutsl fl996) in a 'cold mode' 
setup (see, for example, Rix et al. 2004) found to work 
best for extracting z ~ 2 galaxies. All but one of the 
LIRGs and ULIRGs in our sample are detected in the 
CANDELS mosaics (one of the ULIRGs falls just off the 
edge so we remove it from our final sample). 

3. COMPARISON SAMPLES 

In order to put our results in an evolutionary context, 
we constructed two different comparison samples. One 
is a sample of less luminous galaxies at z ~ 2 and the 
other is a sample of LIRGs and ULIRGs at z ~ 1. Using 
the iLband selected SExtractor catalog, matched to the 
z-band selected TFIT and photometric redshift catalog, 
we selected a sample of objects with the same redshift 
and iLband magnitude distribution as the z ~ 2 ULIRG 
population. These objects are meant to sample the less 
luminous general z ~ 2 galaxy population. We excluded 
all Herschel detected galaxies from this comparison sam- 
ple to ensure that we are excluding ULIRGs and lumi- 
nous LIRGs down to the flux limits of the GOODS-H 
data (Lir < 1O 115 L ). We selected five galaxies for 
each ULIRG in our sample giving us a final comparison 
sample of 260 galaxies. The redshift and H magnitude 
distributions of the ULIRG and comparison samples are 
shown in Figure [2] It is important to note that this com- 
parison sample is not matched to the stellar mass distri- 
bution of the ULIRG sample since the ULIRGs dominate 
at the high mass end. There are not enough high mass 
galaxies at z ~ 2 that are not ULIRGs sinc e most mas- 
sive g alaxies at z ~ 2 are ULIRGs (see, e.g. JDaddi et al.l 
2005). The median stellar mass of the ULIRG sample 
is 10 10,8 Mq while the comparison sample has a median 
stellar mass of 10 10 4 M . 

For the sample of z ~ 1 (U) LIRGs, we combined the 
GOODS-H observations with a sample of M IPS 70 /xm se- 
lected galaxies from the C OSMOS survey (jScoville et al.l 
120071 : iFraver et all l2Q09h . The 70 /im selected sample 
contains 1503 galaxies across the entire COSMOS field 
with Lir measured using the same proced ure as the 
GOODS-H galaxies ([Kartaltepe et al.l l2Q10ah - For the 
z ~ 1 comparison, we selected all galaxies from these 
three fields (GOODS-N, GOODS-S, and COSMOS) with 
0.8 < z < 1.2, as illustrated in Figure [3l Since the 
GOODS-H data are deep but cover a small field and the 
COSMOS- 70 jam data are shallow but cover the large 
COSMOS field, by combining these data sets, we are 
able to sample over two orders of magnitude in luminos- 
ity, log(LiR/L ) = 10.6 — 12.9. This comparison sample 
allows us to look for trends across this full range in lu- 
minosity at a fixed redshift to compare with our results 
at z ~ 2. This sample contains a total of 569 galaxies, 
all with HST/ ACS optical imaging. 
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Figure 2. Redshift (left) and i!f-band magnitude (right) distribu- 
tions for the z ~ 2 ULIRG sample (filled histogram) along with 
the z ~ 2 comparison sample (unfilled histogram). 
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Figure 3. Total infrared luminosity (Lir) as a function of red- 
shift for the three different samples used as a comparison at 
z ~ 1. GOODS- Hers chel galaxies detected at 100 and 160 /xm 
in GOODS-N (black points) and GOODS-S (blue diamonds) along 
with the 70 /xm selected sample in the COSMOS field (Kartaltepe 
et al. 2010a, red squares). The dividing luminosities for LIRGs 
(L m > 10 n L o ), ULIRGs (Lir > 10 12 L©), and Hyperluminous 
Infrared Galaxies (HyLIRGs: Lir > 10 13 Lq) are shown as the 
horizontal dashed lines and the gray shaded area highlights the 
redshift range of the z ~ 1 comparison sample (0.8 < z < 1.2). 

All of the samples used in this paper are described in 
Table [T] along with their properties. 

4. MORPHOLOGIES 

4.1. Visual Classification Scheme 

To fully exploit this unique data set, the CANDELS 
collaboration has begun a large effort to visually classify 
all CANDELS galaxies down to a magnitude limit of H < 
24.5. Each galaxy in the full survey will be classified by 
multiple people to allow for comparisons among different 
classifiers. This task has already been completed for the 
GOODS-S 2 epoch mosaic covering the full field. Details 
of the full classification scheme as well as results and 



comparisons from GOODS-S with multiple classifiers will 
be discussed in J. S. Kartaltepe et al. (in preparation). 

All of the visual classifications are based primarily on 
the iJ-band WFC3 image, but the J-band image along 
with the V and /-band ACS images are included to pro- 
vide additional information and help with the classifica- 
tions. The shorter wavelengths are particularly useful 
for identifying galaxies with a clumpy structure in the 
rest-frame UV (see flags below). We have developed two 
GUIs (one web-based and the other perl based to inter- 
act with ds9) to allow for a uniform implementation of 
the classification scheme we have developed. There are 
four different components to the classification scheme. 

The Main Morphology Class: There are five differ- 
ent options to choose from here and more than one 
may be selected for each galaxy, permitting intermedi- 
ate cases to be indicated. The classes are as follows: 1) 
Disk: These are galaxies with a clear disk structure, 
whether or not they have spiral arms or a central bulge. 
2) Spheroid: These galaxies appear centrally concen- 
trated, smooth, and roughly round/ellipsoidal, regardless 
of their size, color, or apparent surface brightness. 3) Ir- 
regular/Peculiar: These include galaxies that do not 
easily fall into one of the other categories. This class 
is meant to indicate galaxies with irregular structure, 
regardless of surface brightness. This includes objects 
that are strongly disturbed, such as mergers (see Inter- 
action Classes below) but also includes disk or spheroids 
that have slightly disturbed morphologies. 4) Com- 
pact/Unresolved: These objects are either clear point 
sources, unresolved compact galaxies, or are so small that 
the internal structure cannot be discerned. 5) Unclas- 
sifiable: These objects are problematic and cannot be 
classified in any of the other main morphology classes, ei- 
ther because of a problem with the image (satellite trail, 
near bright galaxy, etc.) or because they are too faint for 
any structure to be seen. As noted above, these classes 
are not mutually exclusive because additional informa- 
tion can be gleaned by choosing more than one class. For 
example, choosing both disk and spheroid would identify 
galaxies with both a disk and bulge component. Choos- 
ing disk and irregular identifies objects where the disk 
is still visible but the morphology is slightly disturbed. 
For the purposes of this paper, we combine spheroids and 
compact objects into a single class for simplicity's sake 
and since very few objects fall into the latter category. 

The Interaction Class: There are four different options 
for interaction class and only one of the four (or none) 
can be selected. 1) Merger: These galaxies are single 
objects (including sources with double nuclei) that ap- 
pear to have undergone a merger by evidence of tidal fea- 
tures/structures such as tails, loops, or highly- irregular 
outer isophotes. 2) Interaction within SExtractor 
segmentation map: The primary galaxy appears to 
be interacting with a companion galaxy within the same 
iJ-band segmentation map. Interactions have clear sig- 
natures of tidal interaction; e.g., tidal arms, bridges, 
dual asymmetries, off-center isophotes, or otherwise dis- 
turbed morphologically - being apparent close pairs is 
not enough. To choose interaction over merger, two dis- 
tinct galaxies must be visible. 3) Interaction beyond 
SExtractor segmentation map: The primary galaxy 
appears to be interacting with a nearby galaxy that has 
its own distinct iJ-band segmentation map. By differen- 
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Figure 4. Sample HST-WFC3 F160W postage stamps of z ~ 2 
galaxies in each of the visual morphology classes. 



tiating between interactions within and beyond the seg- 
mentation map we can identify galaxies with possible de- 
blending problems. 4) Non-interacting companion: 
These galaxies have a close companion (in projection), 
yet no evidence of tidal interaction or disturbed mor- 
phology is apparent. For the purposes of this paper, we 
combine the two interaction categories into one and do 
not use the Non-interacting companion category. There- 
fore, the two interaction classes of interest are merger 
and interaction. 

In addition to the above, there are also Structural Flags 
and dumpiness Flags. Classifiers noted objects with the 
following properties that are relevant to the discussion 
in this paper: Tidal tails, double nuclei, asymmetric ob- 
jects, and degree of dumpiness. Further details on these 
flags and others will be discussed in J. S. Kartaltepe et 
al. (in preparation). 

All of the galaxies in the LIRG, ULIRG, and compar- 
ison samples (i.e., all of the samples discussed in this 
paper) were classified by JSK using the above scheme. 
Figure [4] shows a sample z ~ 2 object in each of the 
morphological categories. For the z ~ 2 galaxies, all 
of the (U)LIRGs and comparison sample galaxies were 
randomized before being classified, to avoid any poten- 
tial bias in the classifications. In addition, classifications 
were obtained for the z ~ 2 galaxies from the CANDELS 
team visual classification effort (for both the (U)LIRG 
and comparison sample), resulting in a total of 3 — 5 
classifications per galaxy (for galaxies with H < 24.5). 
For the z ~ 1 COSMOS (U)LIRGs, the classifications 
of Kartaltepe et al. (2010b), based on the F814W im- 
ages, were used and transformed to the above scheme, 
while the /-band ACS images of the GOODS-H z ~ 1 
(U)LIRGs were classified separately by JSK. By using 
the /-band images at z ~ 1 and the iJ-band images at 
z ~ 2, we ensure that we are probing each galaxy's struc- 
ture at approximately the same rest-frame wavelength. 

4.2. GALFIT 



In addition to the visual classifications for each galaxy, 
we also made quantitative measures of th e galaxy mor - 
phology using the GALFIT routine (jPeng et al.ll2QQ2h . 
GALFIT fits the two-dimensional galaxy light profile in 
an image using a y 2 minimization routine to estimate the 
best-fit Sersic profile of the galaxy. We used the iJ-band 
WFC3 image and a single PSF generated with TinyTim 
(Krist & Hook 2011) for all objects. Reliable fits (objects 
in noisy areas of the mosaic were not fit and we excluded 
objects with unrealistic parameters) were obtained for 
47/52 of the z ~ 2 ULIRGs and 201 galaxies in the com- 
parison sample. For all of these objects, we obtained 
measurements of the Sersic index (n, where n = 0.5 cor- 
responds to a Gaussian profile, n = 1 to an exponential 
profile, and n = 4 to a de Vaucouleurs profile) and the 
effective radius. 

5. RESULTS 

We compared the results of the different classifiers for 
each object in the ULIRG and comparison sample. For 
the most part, there is good agreement among the clas- 
sifiers. In particular, the classifiers agree whether some- 
thing is "normal" (i.e., a disk or a spheroid), or "dis- 
turbed" (irregular, or a merger /interaction). The highest 
level of disagreement appears to exist between whether 
an object is a disk, spheroid, or both. This disagreement 
occurs for low surface-brightness galaxies where the disk, 
if present, is very hard to see and distinguish from the 
bulge component. To understand these differences, we 
compared the visual classifications of JSK to the Ser- 
sic index measured using GALFIT for all of the objects 
in both the (U)LIRG and z ~ 2 comparison samples 
(Figure [5]). We find that objects classified as only disks 
tend to have lower Sersic indices ((n) = 1.3 ± 1.2) while 
those classified as only spheroids have higher indices 
((n) = 3.4 ± 1.8). Objects classified as both a disk and a 
spheroid have Sersic indices that match those classified 
only as spheroids ((n) = 3.4 ± 0.9) while those classified 
as irregulars tend to match the disks ((n) = 1.1 ±0.8). In 
order to test how well various classifiers agree and inter- 
pret the morphological scheme, we had everyone classify 
the same set of 200 galaxies so that we could identify 
outliers. We find that nearly all of the classifiers, includ- 
ing JSK, identify roughly the same number of objects as 
belonging to a particular morphological class, i.e., JSK 
is as likely as all of the others to classify an object as a 
given type. For the discussion and figures in this paper, 
we use the visual classifications of JSK and will discuss 
morphological comparisons among multiple classifiers in 
more detail in a future paper (J. S. Kartaltepe et al., in 
preparation). One advantage here is that by using the 
classifications by the same classifier for all of the samples 
in this paper, for CANDELS as well as the COSMOS 
z ~ 1 sources, we ensure that the all of the galaxies are 
classified in a uniform way. 

The main results of the visual classifications are shown 
in Figure [6] and a montage of iJ-band postage stamp im- 
ages of all of the z ~ 2 ULIRGs is shown in Figure 
Plotted in Figure [6] is the percentage of objects in the 
ULIRG sample as well as the z ~ 2 comparison sam- 
ple in each morphological class. Since the morphological 
classes are not mutually exclusive, the totals do not add 
up to 100%. The fractions of objects in both the ULIRG 
and comparison samples that are classified as disks or 
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Figure 5. Distribution of Sersic indices for the z ~ 2 ULIRG and 
comparison samples as measured using GALFIT on the F160W 
HST/WFC3 image. The histogram is color-coded by the main 
morphological class, divided by whether or not a spheroid or disk 
component is present. The objects classified as disks only have a 
mean Sersic index, (n) = 1.3±1.2 while those classified as spheroids 
only have a mean Sersic index, (n) = 3.4 ± 1.8, indicating that 
the visual classifications agree well with the results from GALFIT. 
Objects classified visually as having both a disk and a spheroid 
component have a Sersic index that is consistent with that of the 
spheroid only objects, (n) = 3.4 ± 1.9. The objects classified as 
irregulars have a mean Sersic index of (n) = 1.1 ± 0.8, closer to 
that of the disks. 

spheroids are approximately the same. At first glance, 
this would seem to indicate that at z ~ 2, ULIRGs are 
the same morphologically as galaxies with lower infrared 
luminosities. However, there are significant differences 
in the other categories, most strikingly in the 'Irregular' 
class (64t?% vs. 30 ±3%). The fraction of ULIRGs clas- 
sified as mergers is larger than in the comparison sample 
(1515% vs. 9l^%) but this difference may not be sig- 
nificant given the small number of objects. The fraction 
of ULIRGs classified as interactions (32l 6 %) is greater 

than the fraction of comparison galaxies (5l^%) by a fac- 
tor of six. The last category, 'Combined', encompasses 
all galaxies that could possibly be involved in an interac- 
tion or merger (the combination of mergers, interactions, 
and irregulars). In principle, all mergers and interactions 
would be classified as irregular as well, but in practice 
this does not always happen, particularly when the ob- 
ject is also classified as a disk or a spheroid. This is why 
the final 'combined' category contains a few more ob- 
jects than the 'Irregular' category. Here, the difference 
is quite large, 72^% for the ULIRGs versus 32 ± 3% 
for the comparison sample. It is intriguing that the dif- 
ference between the fractions classified as irregulars and 
interactions is large while the fraction classified as disks 
is roughly the same. This is an indication that many of 
the ULIRGs were classified as both, and that while an in- 
teraction or merger might be taking place, the disk is still 
present. This suggests that the role of minor and early 
stage mergers could play an important role in ULIRGs 
at z ~ 2. 

The effective radii determined from GALFIT for the 
ULIRGs and z ~ 2 comparison sample are shown in 
Figure [U The ULIRGs have radii which range from 
0.9 — 9.5 kpc with a median value of 3.3 kpc while the 



Figure 6. Percentage of z ~ 2 ULIRGs and comparison sample 
galaxies (260 lower luminosity systems at the same redshift) in 
each of the visual morphology classes. Note that since the classes 
are not mutually exclusive, the percentages do not sum to 100%. 
The combined category includes all objects classified as mergers, 
interactions, or irregulars. While the fraction of objects classified 
as disk or spheroid is about the same between the ULIRGs and 
comparison sample, the fraction of objects classified as irregular or 
interactions is significantly higher among the ULIRG sample. The 
error bars on each point reflect the 1 a binomial confidence limits 
given the number of objects in each category, following the method 
of Cameron et al. (2011). 

comparison sample ranges from 0.3 — 9.2 kpc with a me- 
dian of 2.5 kpc. A KS-test of the distributions shown 
in Figure [8] indicates that these two are not likely to be 
drawn from the same distribution (P = 0.006). This 
means that the ULIRGs are significantly more extended 
than the typical z ~ 2 galaxy population. A similar anal- 
ysis of the distribution of Sersic indices finds that the 
ULIRGs range from n = 0.2 - 6.8 with (n) = 1.8 ± 1.5 
and a median value of 1.4 while the comparison sample 
ranges from n = 0.2 — 7.8 with (n) = 2.2 ± 1.9 and a 
median of 1.4. A KS test of these two samples indicates 
that they are consistent with being drawn from the same 
population (P = 0.20). So while the ULIRGs are more 
spatially extended on average than the rest of the z ~ 2 
population, they have similar profiles and bulge to disk 
ratios. This result is consistent with the ULIRGs being 
more massive on average (and therefore larger) than the 
galaxies in the z ~ 2 comparison sample. 

6. DISCUSSION 

6.1. Evolution of ULIRG Morphology 

In the local universe, the merger fraction among IR 
galaxies increases systematically from ~ 10% for objects 
with log(L m /L ) = 10.5 - 11.0 to 100% for ULIRGs, 
with objects in between, LIRGs, presenting a wide 
range of morphologies from star- forming disks, to minor 
and maj or interacting systems, to more advanced merg- 
ers (e .g., Ishida 2004; Veilleux et al. 2002; Hwang et al.l 
120101 : lHaan et al.l 1201 ID . Most local ULIRGs are ad- 
vanced stage mergers with a single nucleus and this 
fraction is nearly 10 0% for the most luminous ULIRGs 
(jVeilleux et al.ll2002D . From this, it appears that the ex- 
treme environment of the final coalescence of a merger 
is necessary to produce these high luminosities in the 
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Figure 7. HST-WFC3 F160W postage stamps of our sample of 52 z ~ 2 ULIRGs. Each stamp is 5" x 5". 
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Figure 8. Distribution of the iif-band effective radii in kpc for the 
z ~ 2 ULIRG sample (shaded region) and the comparison sample 
(unshaded). The mean value for the ULIRGs is 3.7 db 1.7 kpc and 
2.8 db 2.0 kpc for the comparison sample. A KS-test of the two 
samples indicates that they are not likely to be drawn from the 
same distribution (P = 0.006). 

local universe. This is supported by numerical simu- 
lations that show that a galaxy's star formation rate 
peaks during the f inal coalescence of a mer ger (e.g., 
iMihos fc Hernauistl 119961 : iHopkins et all [2006). While 
the overall trend between merger fraction and infrared lu- 
minosity is similar at higher redshifts, ULIRGs at z ~ 1 
span a wider range of i nteraction stages, with roughly 
half at pre-coalescence (jKartaltepe et all l2Q10bh . From 
our results at z ~ 2, this trend seems to continue. For 
these objects, we have seen that only ~ 15% appear to 
be late stage mergers, while 57% are comprised of inter- 
actions and irregular morphologies. The large fraction 
of interactions implies that the first passage of a merger 
is sufficient to produce ULIRG luminosities, though not 
necessary since many show no signs of major mergers 
or interactions. In addition, given the large fraction of 
disks with irregular morphologies, it is possible that even 
minor mergers play a significant role in increasing the lu- 
minosity at z ~ 2. 

A comparison of z ~ 2 and z ~ 1 morphologies is 
shown in Figure Plotted is the fraction of objects 
in various morphological classes as a function of Ltr. 
The dotted points connected by colored lines come from 
the full z ~ 1 comparison sample and the stars repre- 
sent the z ~ 2 ULIRGs. Here, the visual morphologies 
are divided into mutually exclusive categories to identify 
trends. These categories are Non-interacting disks, con- 
taining all objects classified as disks but not as mergers or 
interactions, Pure Spheroids, containing all objects clas- 
sified as a spheroid but not as a disk (all objects classified 
as both are in the disk category), Irregular only, contain- 
ing all objects classified as irregular, but not as a disk, 
merger, or interaction, and All mergers and interactions, 
containing all objects classified as a merger or an inter- 
action. In addition, shown in black is the total fraction 
of objects classified as a merger, interaction, or irregular. 

A few trends in the z ~ 1 sample are clear. The frac- 
tion of non-interacting disks (shown in blue) decreases 
dramatically with infrared luminosity while the fraction 
of mergers and interactions increases. The fraction of 



Figure 9. Fraction of objects in each morphological class as a 
function of Ljr, for the z ~ 2 ULIRGs (stars) and the z ~ 1 com- 
parison sample (points). The fraction of objects classified as non- 
interacting disks decreases with Lir while the fraction of mergers 
and interactions increases. The fraction of spheroids and irregular 
only objects remains the same over the full luminosity range. The 
z ~ 2 ULIRGs have nearly the same fractions as the ULIRGs at 
z ~ 1, indicating that little evolution has occurred between these 
two redshifts. 

mergers and interactions among the z ~ 2 ULIRGs is 
slightly lower than at z ~ 1 while the fraction of non- 
interacting disks is slightly higher (at the ~ 2<j level) 
at the same IR luminosity. This suggests that there is 
slight evolution between z ~ 2 and z ~ 1 , in terms of the 
total fraction of objects involved in a merger or interac- 
tion, consistent with the evolution of the zero-point of 
the star-forming main sequence (see §6.3) between these 
redshifts. 

The fraction of objects classified as pure spheroids or 
irregular only is small, and remains roughly constant 
across the full luminosity /redshift range. It is interest- 
ing to note that the difference between the mergers and 
interactions trend (green) and the mergers, interactions, 
and irregular trend (black) is about the same across the 
entire luminosity range. The difference between these 
two represents objects that are classified as both disks 
and irregular, which possibly represent the contribution 
from minor mergers. If so, we note that their contribu- 
tion appears to be the same at all infrared luminosities. 
Within error, this difference also seems to be the same 
for z ~ 2 ULIRGs, although, as noted before, they seem 
to contribute less to the z ~ 2 comparison sample (with 
Lm < 10 11,5 ). An alternative possibility is that these 
irregular disks represent the 'clumpy' disks predicted by 
numerical simulations of cold flows (e.g., Bournaud & 
Elmegreen 2009; Dekel et al. 2009b). We note, how- 
ever, that although we cannot distinguish between these 
two possibilities using morphology alone, these irregular 
disks are more likely to be asymmetric than clumpy, as 
indicated by our structural flags described in §4. 

6.2. Comparison with Previous Results 

Early work on the light profiles of local ULIRGs sug- 
gested th at they followed a n elliptical-like r 1//4 pro- 
file (e.g., IWright et all [1990). A detailed analysis of 
the surface brightness profiles of all the single nucleus 
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systems from the IRAS 1 Jy Survey of ULIRGs by 
IVeilleux et "all (|2QQ2D found that they are best fit by r 1 / 4 
profiles and have me an half-light radii of 4.80 ± 1.37kpc. 
Veill eux et al.l (j2006[ ) found similar results with a NIC- 
MO S survey of 33 nearby ULIRGs - single nucleus sys- 
tems have elliptical-like radial profiles, though a few ob- 
jects have extended exponential disks. Since the single- 
nucleus systems in the local studies are all advanced stage 
mergers and have elliptical-like profiles, it follows that 
these are massive elliptical galaxies in formation. Our 
z ~ 2 ULIRG sample encompasses a range of morpholo- 
gies, however, and the GALFIT Sersic index measure- 
ments indicate that most of these systems have surface 
brightness profiles that more closely match exponential 
disks (n ~ 1). However, there is a tail of objects that 
extend to higher Sersic indices and are closer to being 
bulge-dominated. Four objects have n > 4. 

Since these z ~ 2 ULIRGs have a wide range of mor- 
phologies and include many early stage mergers, perhaps 
a better analog for comparison would be local LIRGs. 
LIRGs in the local universe span the full range of mor- 
phologies we see in the z ~ 2 ULIRG sample, including 
isolated disks, minor merger systems, and early stage 
interactions along with some more advanced stage merg- 
ers and elliptical-like systems. An analysis o f the LIRGs 
from the IRAS Bright Galaxy Sample (BGSi lSoifer et all 
1989) found that most LIRGs can be fit by a n = 1 pro- 
file Jlshida 2004) and that the light profiles can increase 
at large radii due to nearby companions. Other studies 
of objects from t he Great Observat ories All-Sky LIRG 
Survey (GOALS: Urmus et al.ll2009h find that the Sersic 
indices of local LIRG s cover a wide ran ge of values, from 
n ~ 1 to n ~ 4 (e.g., Haa n et all 120111 : D. C. Kim et al., 
in preparation). This result is similar to what we see for 
our z ~ 2 ULIRG sample. 

One of the first studies of ULIRG morphology at high 
redshift was conducted by iDasvra et al.l (|2008l ) using 
NICMOS imaging of 33 z ~ 2 ULIRGs selected from 
a 24 /im sample with additional color criteria to ensure 
they were at high redshift. They found that half of these 
systems were interactions and that they had disklike pro- 
files, consistent with the results found for our sample. 
These objects have a m ean effective rad i us of 2 .5 kpc and 
Sersic indices n < 1.35. IZamojski et al.l (|2011f ) expanded 
upon this sample of 33 by investigating the morphology 
of these plus an additional 101 flux-limited (^24 > 0.9 
mJy and m R > 20) LIRGs and ULIRGs at 0.5 < z < 2.8 
observed with IRS (Dasyra et al. 2009), using NIC- 
MOS imaging. For direct comparison with our z ~ 2 
ULIRG sample, we focus on their z > 1.5 subsample. 
This subsample contains 54 objects over a similar red- 
shift range but with higher luminosities than our sample 
(Lir > 10 12,5 L©, including a significant number with 
Lm > 1O 13 L ). They find that 50% of this sample 
are pairs or early stage mergers, 33% are advanced stage 
mergers, and 16% are merger remnants (including ellip- 
tical and point sources). That they find nearly all of 
the objects in this sample to be involved in a merger 
or interaction is striking, though perhaps not surprising 
considering the more extreme luminosity range covered 
by this sample, and the inclusion of ellipticals and point 
sources as merger remnants. Since we do not include el- 
lipticals and point sources, the 83% of objects classified 



as mergers represent a better comparison to our results. 
This fraction is larger than what we find for our sam- 
ple, but is consistent given the higher luminosity of the 
objects. They find that their sample typically has low 
Sersic indices (n < 2, though a mean value is not given) 
and that even during coalescence the mergers in their 
sample remain disk-dominated, possibly due to a higher 
gas fraction at z ~ 2. This is consistent with what we 
find for our z ~ 2 ULIRG sample, which also has a low 
mean Sersic index. 

Another selection of ULIRGs that has been studied at 
high redshift in the literature are the color selected Dust 
Obscured Galaxies (DOGs: f„(24 /am)/ f„(R) > 1000), 
Dey et al. 2008). Bussmann et al. (2009, 2011) con- 
ducted a detailed analysis of NICMOS images of 31 
Tower-law' DOGs (thought to be AGN dominated) and 
22 'Bump' DOGs (star formation dominated), respec- 
tively. They find that the Bump DOGs are larger than 
Power-law DOGs and tend to have more diffuse and 
irregular morphologies. The Power-law DOGs appear 
more relaxed than local ULIRGs and are split morpho- 
logically - half have regular and half have irregular mor- 
phologies. They have a mean Sersic index of (n) = 0.9 
with a range of 0.1 — 2.2, somewhat lower than in our 
z ~ 2 sample. The Bump DOG sample has a mean Sersic 
index of (n) =0.8, consistent with their being more dif- 
fuse and irregular than the Power-law DOGs, but again, 
lower than in our z ~ 2 ULIRG sample. Melbourne et al. 
(2009) used ground-based AO imaging of 15 z ~ 2 DOGs 
to investigate their morphology and found that eight 
were disks, four were ellipticals, two were unresolved, 
and one was diffuse. They found Sersic indices n < 2 
for nine of the objects and n > 3 for five of them. They 
found little evidence for merger activity among this sam- 
ple and concluded that the merger fraction among DOGs 
is lower than the general z ~ 2 ULIRG population. For 
comparison, if we look at the subset of our z ~ 2 ULIRGs 
that meet the DOG selection criteria (14 objects, see §6.4 
for more details), we find that 12 are classified as a disk, 
two as irregular, and one as a spheroid. Four of the disks 
are classified as having a spheroid as well, indicating a 
significant bulge component. Only two DOGs are clas- 
sified as interactions and none as mergers. The DOG 
subsample of our z ~ 2 ULIRGs does appear to have a 
lower merger fraction than the full ULIRG population, 
highlighting the selection effects present in color-selection 
techniques. The mean Sersic index of our DOG subsam- 
ple is (n) = 2.0, a bit higher than for the full sample. We 
note, however, that the DOGs studied by Bussmann et 
al. (2009, 2011) and Melbourne et al. (2009) have higher 
luminosities than the sub-set of our ULIRGs that meet 
the DOG criteria since they were selected from the wide, 
but relatively shallow, Bootes field. 

Using submillimeter surveys, Swinbank et al. (2010) 
studied the morphologies of 25 SMGs with NICMOS 
imaging over the redshift range (0.7 < z < 3.4) and found 
a mean Sersic index of (n) = 1.4 ± 0.8, similar to the 
value for our z ~ 2 ULIRG sample. Targett et al. (2011) 
looked at a sample of 15 SMGs using ground-based K- 
band imaging (above the 4000A break out to z = 4) and 
found little evidence for interactions among their sam- 
ple. They found that the morphology of their sample 
was closer to that of exponential disks with (n) = 1.44 
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with a median value of 1.08. On the other hand, stud- 
ies of the preponderance of close radio doubles amongst 
SMGs, and their gas dymamics, have found evidence for 
both early-stage an d maj or mergers (e.g.; Ilvison et al.l 
1200ft iTacconi et al.l [20081 : lEngel et aTll2QlQh . Addition- 
ally, a quantitative morphological analysis of 11 SMGs 
by Conselice et al. (2003) found that 61 ± 21% of them 
are major mergers. One possible explanation for the dis- 
crepancy among these different studies of SMGs is the 
difficulty in identifying signatures of interactions at high 
redshift from ground-based (non-AO) images. Also, the 
presence of an exponential disk light profile does not pre- 
clude the presence of mergers - indeed, our results show 
that many galaxies with merger morphologies have low 
Sersic indices. Though the Lir range covered by these 
studies is not given, SMGs in general tend to have more 
extreme luminosities than typical ULIRG samples. It is 
likely that our sample of z ~ 2 ULIRGs is a population 
that falls in between more extreme sources like SMGs 
and the samples of Dasyra et al. (2008) and Zamojski 
et al. (2011) and more moderate luminosity samples, 
such as those selected via the BzK color selection tech- 
nique typically studied at high redshift (e.g., Genzel et 
al. 2008). 

All of the previous studies at z ~ 2 described in 
this subsection have attempted to quantify the role of 
galaxy mergers among high redshift ULIRGs, and the 
wide range of results they have found has lead to some 
confusion about the nature of z ~ 2 ULIRGs. By com- 
paring these various studies to our z ~ 2 ULIRGs with 
an understanding of the different redshift and luminosity 
ranges sampled, we highlight some of the pitfalls of at- 
tributing properties of subsets of objects (such as DOGs 
or BzK selected galaxies) to the entire ULIRG popula- 
tion. This comparison has also highlighted some of the 
differences that arise from identifying mergers in differ- 
ent ways and using data of varying quality. All of these 
factors can make a direct comparison difficult. In gen- 
eral, the results from our morphological study of z ~ 2 
ULIRGs are consistent with these studies when compar- 
ing objects at the same luminosity, with the same color- 
selections and morphological properties, where possible. 
Although our ULIRG sample does not reach the same 
extreme luminosities that some of the previous studies 
do, it does sample more typical z ~ 2 ULIRGs that were 
common at that epoch. Our sample is the first complete, 
FIR selected sample of ULIRGs at z ~ 2 and is therefore 
unaffected by the biases of the previous studies. 

6.3. Role of Mergers Among Starbursts 

Many recent studies have found that a galaxy's star 
formation rate (SFR) and its stellar mass (M*) are 
tightly correlated and that the bulk of star form- 
ing galaxies follow a "main sequence" relation that 



2004; 



2007, 



evolves with redshift (e.g., [B rinchma nn et al.l 

jNoeske et aH 1200ft IPadcUet a l. 2007; Elbaz et al..._ 

1201 ll : iRodighiero et al 1 f20TTL Wuyts et al. 201 1|). Galax- 
ies with SFRs elevated significantly above this relation 
are considered to be starbursts. Here, we investigate 
the positions of z ~ 2 (U)LIRGs relative to the main 
sequence and look for differences among main sequence 
(U)LIRGs and starburst (U)LIRGs. We use the relation 
between a galaxy's specific star format i on ra te (sSFR) 
and redshift determined by lElbaz et al.l (j2011f ) to divide 



our sample into main sequence and starburst galaxies. 

Figure [10] plots the sSFR as a function of redshift in 
the left panels for the z ~ 2 (U)LIRGs, with their visual 
classifications split in different ways. We include the 70 
LIRGs in addition to the 52 ULIRGs in this analysis to 
span a wider range of SFR. We derive the star formation 
rate from the total infrared luminosity that we deter- 
mined from the template fitting in §2.1. The panels on 
the right show histograms of the 'starburstiness' param- 
eter, or the ratio between its sSFR and the sSFR value 
of the main sequence at that redshift, for each class. The 
top row includes all of the (U) LIRGs color coded in the 
same way as Figure [9l The galaxies are split into non- 
interacting disks, pure spheroids, irregular only, and all 
mergers and interactions. The color-coding is the same in 
the left and right hand panels. Here, we define starburst 
galaxies as those with sSFR greater than a factor of three 
above the main sequence (as indicated by the dashed 
lines). This results in 70 main sequence galaxies and 52 
starbursts among our z ~ 2 (U)LIRG sample. For the 52 
ULIRGs specifically, there are 28 main sequence galaxies 
and 24 starbursts. The exact dividing value used to sep- 
arate starburst from main sequence galaxies is somewhat 
arbitrary (factors of two and four have both been used 
in the literature). Here we are interested in trends in 
morphology as a function of the distance from the main 
sequence and adopt a factor of three for this comparison. 

The fractions of main sequence and starburst galaxies 
in each morphological class are given in Table El Non- 
interacting disks make up 57% of (U) LIRGs on the main 
sequence and 42% of those that are starbursts (note how- 
ever, that 50% of the starburst non-interacting disks are 
also irregular). On the other hand, mergers and interac- 
tions make up 24% of objects on the main sequence and 
50% of starbursts. All but one of the objects classified 
as only irregular are on the main sequence. The galaxies 
classified as pure spheroids are split between 11% of ob- 
jects on the main sequence and 6% of starbursts. We note 
that these numbers do not change significantly if we use 
a dividing value of a factor of four instead. In Table [2] 
we also list the mean and median values of the 'star- 
burstiness' for each morphological classification. Both 
the mean and median values for the mergers and inter- 
actions are elevated above the values for the other classes 
(with means of 6.6 for the mergers and interactions ver- 
sus 3.7 for the disks, 3.6 for the spheroids, and 1.7 for 
the irregulars). This shows that as a whole, the merg- 
ers and interactions have significantly elevated specific 
star formation rates relative to the other morphological 
classes. 

More can be learned by splitting up the visual clas- 
sifications in other ways. The middle row of Figure fTOl 
shows only those objects classified as disks or spheroids 
divided into three classes: pure disk, pure spheroid, or 
disk and spheroid, regardless of whether the objects are 
also classified as irregular, mergers, or interactions. This 
includes a total of 96 sources, 59 of which are main se- 
quence galaxies and 37 are starbursts. The remaining 
26 sources not included here are classified as irregular, 
merging, or interacting, but not as disks or spheroids. 
The fraction of objects classified as a pure disk is roughly 
the same for the main sequence and starburst galaxies 
(61 versus 75%). Spheroids make up 15% of the main 
sequence and 14% of the starbursts. Objects classified 
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Figure 10. Left Panels: Specific star formation rate (sSFR) as a function of redshif t for the z ~ 2 (U)LIRGs coded by their visual 
morphology. The solid line indicates the position of the star-forming main sequence (Elbaz et al. 2011) and the dashed lines indicate values 
of a factor of three above and below the main sequence. Galaxies with sSFR a factor of 3 above the main sequence are considered to be 
"starburst galaxies." Right Panels: Stacked histogram of the 'starburstiness' parameter: the ratio between the sSFR rate and the main 
sequence relation, color coded by visual morphology. The dashed line divides objects that are a factor of three above the main sequence 
relation and therefore starbursts. The three rows highlight various features of the morphological classifications. 
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as both disks and spheroids make up 14% of the main 
sequence and 11% of starbursts. These numbers indi- 
cate that the presence of a bulge is not correlated with 
whether or not a galaxy is on the main sequence - all 
three distributions are roughly the same. 

The bottom row of Figure [TO] shows only those objects 
classified as interactions, mergers, or irregular. If this 
subsample of objects is interpreted as the collection of 
all potentially merging systems, then dividing these ob- 
jects up in various ways will allow us to investigate the 
role of different kinds of mergers (e.g., minor versus ma- 
jor) among starburst and main sequence galaxies. All 
objects classified as disks or spheroids, but not irregu- 
lar, mergers, or interactions are excluded from this plot. 
These irregular /merging/inter acting objects are then di- 
vided based on whether or not they are classified as a 
disk at all. Those without a disk are then divided into 
interactions, mergers, irregular, or spheroid+irregular. 
Those that are classified as disks in this subsample are 
also classified as interactions, mergers, or irregular, but 
still show signs of the presence of a disk. We have sep- 
arated the objects based on the presence of a disk to 
examine the role that minor and early stage interactions 
might play. Of this subsample of objects (74 objects in 
total), only 11% of those on the main sequence are classi- 
fied as interactions (where the disk is no longer visible), 
a fraction which increases to 26% for starbursts galax- 
ies. Likewise, only 8% of the main sequence galaxies are 
mergers while in starbursts we identify 16%. This fac- 
tor of two difference between interactions and mergers on 
the main sequence versus starburst systems suggests that 
these potential major merger systems have an important 
impact on the energy output of starburst systems. 

Objects that are potential minor mergers or 
early stage interactions (i.e., disks AND irregu- 
lar/interactions/mergers) make up 61% of this subsam- 
ple on the main sequence and 53% of starbursts. This 
means that even minor and early stage interactions 
play an important role in starburst systems (though 
also important for galaxies on the main sequence). To 
understand the relative role of these two processes, early 
stage interactions (objects classified as interactions and 
disks) and possible minor mergers (objects classified as 
irregulars and disks, but not mergers or interactions) 
are shown separately in the bottom section of Table 2. 
Each of these categories has roughly equal numbers (22 
and 19, respectively) and each has roughly the same 
number in the main sequence (10 and 12, respectively) 
and starburst (9 and 10, respectively) categories. Only 
one object was classified as a disk and a merger and 
it is in the starburst category. Only three objects are 
classified as both spheroid and irregular. These are 
potential merger remnants and make up 6% and 3% of 
the main sequence and starburst categories, respectively. 

To summarize, 50% of starbursting (U)LIRGs are clear 
mergers and interactions. If all galaxies with irregu- 
lar classifications are included, this means up to 73% 
of starbursts are potentially involved in a merger or in- 
teraction at some level. Half of these objects are major 
interactions/mergers where the disk has been destroyed, 
while the other half are either minor mergers or early 
stage interactions where the disk is still present. This 
implies that both of these processes play an important 
role in starburst galaxies at z ~ 2. It is important to 
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Figure 11. Specific star formation rate (sSFR) as a function of 
redshift for the 17 (U)LIRGs that meet the DOG selection criteria, 
color coded by their visual morphology. The solid and dashed lines 
indicate the range of th e star-forming main sequence and starburst 
galaxies as in Fig. [TO] Only four of these objects are considered 
starbursts. The DOG subsample of (U)LIRGs is dominated by disk 
morphologies. 

note that mergers and interactions make up a significant 
fraction of main sequence galaxies as well. This is not 
surprising since the transition of a galaxy into a starburst 
system depends on several factors, including the merger 
timescale, progenitor mass ratios, and gas masses. Nu- 
merical simulations (e.g., Mihos & Hernquist 1996; Hop- 
kins et al. 2006) have shown that a during a merger, a 
galaxy's star formation rate is only expected to be sig- 
nificantly enhanced at particular phases of the merger 
(i.e., first passage and final coalescence). For the remain- 
der of the time, a galaxy undergoing a merger would be 
expected to lie on the main sequence. Th i s wou ld be 
consistent with the results of iHwang et al.l (j2011f ) who 
find that a galaxy's sSFR increases as it approaches a 
late- type neighbor. 

6.4. DOGs and AGN 

Here, we analyze the properties of (U)LIRGs in 
our sample that meet the DOG selection criteria 
(/„ (24/rni)//„(i?) > 1000)). Seventeen objects in our 
sample are DOGs and all but three of them are ULIRGs 
(see discussion of 14 ULIRG DOGs in §6.2). These ob- 
jects have (Lir) = 10 12 2 L© and (z) = 2.10. The posi- 
tion of these objects on the sSFR versus redshift diagram 
is plotted in Figure [TTJ color coded by their morpholo- 
gies. Almost all of these objects fall on the main sequence 
- only four can be considered starbursts - and for the 
most part, they consist of disk morphologies. One of the 
objects is a spheroid, two are irregular and three are in- 
teractions. The mean starburstiness value for the DOGs 
is 2.5. 

We also cross-matche d our (U)LIRG sample to the 
Chandra 4MS catalog (jXue et al.l l2011f ) to investigate 
the role of AGN among this sample. Figure [12] plots 
all of the (U)LIRGs with detections in the X-ray (30 
sources in total) coded by morphology on the left and 
X-ray luminosity on the right. Half of the objects are 
classified as disks while one quarter are classified each 
as spheroids and mergers/interactions. Almost all of 
the X-ray AGN lie on the main sequence, as seen for 
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Figure 12. Specific star formation rate (sSFR) as a function of 
redshift for the 30 X-ray detected (U) LIRGs, color coded by their 
visual morphology (left) and X-ray luminosity (right). The objects 
marked by boxes in the right panel are galaxies classified as star- 
forming only (non-AGN). The solid and dashed lines indicate the 
range o f th e star-forming main sequence and starburst galaxies as 
in Fig. [10] Five of the X-ray detected (U)LIRGs are starburst 
systems, four AGN (one spheroid and three interactions/mergers) 
and one non-AGN (an interaction / merg er) . 

the full X-ray selected population (Mullaney et al. 2011), 
with a mean starburstiness of 2.2. Two X-ray detected 
sources are classified as non-AGN, based on their x- 
ray emission being consistent with levels that can arise 
purely from star form ation (following the method of 
lAlexander et al.ll2QQ5f ). rather than AGN in the Chan- 
dra 4MS catalog and these are highlighted in the right 
panel. Of the five X-ray detected starburst systems 
(sSFR/sSFRms > 3), one is classified as a non-AGN 
(and is morphologically an interaction/merger), three are 
AGN with 42.5 < log(L x / 'erg s _1 cm" 2 ) < 43.0, and one 
is an AGN with 43.5 < log(L x / 'erg s _1 cm" 2 ) < 44.0. 
Four of the five starbursts are mergers or interactions. 
The mean starburstiness for the AGN classified as merg- 
ers or interactions is 3.4, which is significantly elevated 
above the value for the disks, 1.7. From this result, it 
would appear that AGN play a minor role in starburst- 
ing systems, but those AGN that are starbursts are dom- 
inated by mergers and interactions. While there are few 
AGN that appear to be starbursts, it is possible that 
starbursts host more obscured AGN that would not be 
detected in X-ray surveys. 

7. SUMMARY 

We have presented a complete volume limited sample 
of 52 z ~ 2 ULIRGs selected from the GOODS- Herschel 
survey with CANDELS WFC3 F160W imaging. These 
objects span a luminosity range of 10 12 L© < Lir < 
10 12,9 L© with a median luminosity of 10 12,3 L© over the 
redshift range, 1.5 < z < 3.0. This is the first complete 
sample of high-redshift far-infrared selected ULIRGs 
with high resolution near-infrared imaging. We have con- 
ducted a detailed morphological analysis of these objects 
along with high luminosity LIRGs (Lir > 10 11,3 L©) at 
the same redshift, a z ~ 2 comparison sample without 
Herschel emission and with the same redshift and H-band 
magnitude distribution as the ULIRGs, and a z ~ 1 com- 
parison sample spanning over two orders of magnitude in 
luminosity 10 10,7 < Lir/L© < 10 12,9 . From an investi- 
gation of the properties of these samples, the following 
conclusions can be drawn: 

1. Visual morphological classifications of the ULIRG 
sample using high resolution NIR imaging indi- 
cate that they have roughly the same fractions of 
disks and spheroids as the z ~ 2 comparison sam- 
ple (57% and 30%, respectively). However, there 



are significantly more ULIRGs classified as irreg- 
ular (72%) or interacting (32%). Over 70% of the 
ULIRG sample is classified as a merger, interaction, 
or irregular, compared to 32% of the comparison 
sample. Clear mergers and interactions make up 
47% of the sample - we consider this to be a lower 
limit since clear signatures are difficult to identify 
at this high redshift. 

2. At z ~ 1, galaxy morphology is tightly correlated 
with Lir, as has been observed locally. The frac- 
tion of objects classified as disks declines system- 
atically with luminosity while the fraction of merg- 
ers and interactions increases. The morphologies 
of the z ~ 2 ULIRGs have very similar fractions 
to objects at z ~ 1 with comparable luminosity at 
the same rest-frame wavelength, though there are 
slightly fewer mergers and interactions and slightly 
more disks at z ~ 2. This suggests that there 
has been a slight evolution in the morphology of 
ULIRGs between these two redshift s. 

3. We identify 52 z ~ 2 LIRGs and ULIRGs as star- 
bursts based on their elevated specific star forma- 
tion rates relative to the main sequence. Many of 
these starbursts are clear mergers and interactions 
(50%) while disks make up only 42%. Among these 
disks, many have irregular morphologies. It is pos- 
sible that the combination of objects classified as 
both disks and either irregular or interactions rep- 
resent early stage interactions and minor mergers. 
Taken together, up to 73% of starbursts could be 
interacting or merging at some level, with a signif- 
icant contribution from minor mergers. 

4. (U)LIRGs on the main sequence are dominated by 
non-interacting disks (57%) but a significant frac- 
tion are mergers or interactions (24%). This re- 
sult is expected, since simulations have shown that 
many mergers may never reach a starburst phase 
(especially if they lack the required gas densities) 
and those that do are only true starbursts for a frac- 
tion of the merger process. Many of the mergers 
and interactions we observe on the main sequence 
may be at an early stage and have not yet reached 
the starburst phase. 
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Table 1 

Samples Discussed in This Paper 




Name 


# Redshift Range log(Lm/ 'Lq) 


Relevant Figures 




GOODS-H + CANDELS Samples 


ULIRGs 

LIRGs 

(U)LIRGs a 


52 1.5-3.0 12.0-12.8 
70 1.5-3.0 11.3-11.9 
122 1.5-3.0 11.3-12.8 


1, 2, 5-12 
1, 10-12 
1, 10-12 


Comparison Samples 


z ~ 2 Comparison Sample 260 1.5 - 3.0 < 11.5 
z~l Comparison Sample 569 0.8 - 1.2 10.6 - 12.9 


2, 5, 8 
3, 9 



a This is a combination of the LIRG and ULIRG samples and are collectively referred 
to as (U)LIRGs throughout the paper. 



Table 2 

Percentage of (U) LIRGs in Each Morphological Class 



Morphological Classification Main Sequence Starbursts (sSFR/sSFRms) med(sSFR/ sSFRms) 

# % # % 



All (U)LIRGs - 122 objects 



Non-interacting Disks 


40 


57 


22 


42 


3.7 


2.4 


Non-interacting Spheroids 


8 


11 


3 


6 


3.6 


1.4 


Irregular Only 


5 


7 


1 


2 


1.7 


1.3 


All Mergers and Interactions 


17 


24 


26 


50 


6.6 


3.8 



All (U)LIRGs classified as disks or spheroids - 96 objects 



Pure Disk 


42 


71 


28 


75 


4.5 


2.6 


Pure Spheroid 


9 


15 


5 


14 


8.0 


2.3 


Disk and Spheroid 


8 


14 


4 


11 


2.0 


1.5 



All (U) LIRGs classified as irregular, mergers, or interactions - 74 objects 



Interactions, no disk 


4 


11 


10 


26 


6.6 


3.8 


Mergers, no disk 


3 


8 


6 


16 


4.3 


4.4 


Irregular Only 


5 


14 


1 


3 


1.6 


1.3 


Disk+Irr/Int/Mer 


22 


61 


20 


53 


4.4 


2.8 


Spheroid+Irregular 


2 


6 


1 


3 


5.9 


0.9 


All (U)LIRGs 


classified as 


irregular, 


mergers, or 


interactions AND disks 


- 42 objects 


Irregular Disks 


12 


55 


10 


50 


3.9 


2.8 


Interacting Disks 


10 


45 


9 


45 


4.7 


2.6 


Disk+Merger 








1 


5 


9.2 





